Bayesian bi-clustering methods with applications in computational biology

نویسندگان

چکیده

Bi-clustering is a useful approach in analyzing large biological data sets when the observations come from heterogeneous groups and have number of features. We outline general Bayesian tackling bi-clustering problems moderate to high dimensions propose three models on categorical which increase complexities their modeling distributions features across bi-clusters. Our proposed methods apply wide range scenarios: situations where are cluster-distinguishable only among small subset but masked by amount noise different identified or exhibit hierarchical structures. Through simulation studies we show that our outperform existing (bi-)clustering both identifying clusters recovering feature distributional patterns further developed approaches human genetic dataset, single-cell genomic collection 1774 mouse datasets with focus 58 genes two pathways.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian methods in bioinformatics and computational systems biology

Bayesian methods are valuable, inter alia, whenever there is a need to extract information from data that are uncertain or subject to any kind of error or noise (including measurement error and experimental error, as well as noise or random variation intrinsic to the process of interest). Bayesian methods offer a number of advantages over more conventional statistical techniques that make them ...

متن کامل

Computational Systems Biology in Cancer: Modeling Methods and Applications

In recent years it has become clear that carcinogenesis is a complex process, both at the molecular and cellular levels. Understanding the origins, growth and spread of cancer, therefore requires an integrated or system-wide approach. Computational systems biology is an emerging sub-discipline in systems biology that utilizes the wealth of data from genomic, proteomic and metabolomic studies to...

متن کامل

Computational Intelligence in Clustering Algorithms, With Applications

Cluster analysis plays an important role for understanding various phenomena and exploring the nature of obtained data. A remarkable diversity of ideas, in a wide range of disciplines, has been applied to clustering research. Here, we survey clustering algorithms in computational intelligence, particularly based on neural networks and kernel-based learning. We further illustrate their applicati...

متن کامل

Kernel Methods, Multiclass Classification and Applications to Computational Molecular Biology

Support Vector Machines for pattern recognition were initially conceived for the binary classification case. A common approach to address multi-classification problems with binary classifiers is that of reducing the multiclass problem to a set of binary sub-problems, and combine their predictions in order to obtain a multiclass prediction. Reduction schemes can be represented by Error Correctin...

متن کامل

Computational methods in synthetic biology.

We discuss how a theoretical synthetic biology research programme may liberate empiricism in biological sciences beyond the unaided human brain. Because synthetic biological systems are relatively small and largely independent of evolutionary contexts, they can be represented with mathematical models strongly founded on first principles of molecular biology and laws of statistical thermodynamic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Annals of Applied Statistics

سال: 2022

ISSN: ['1941-7330', '1932-6157']

DOI: https://doi.org/10.1214/22-aoas1622